Picture for Andrew Wang

Andrew Wang

DeepSearchQA: Bridging the Comprehensiveness Gap for Deep Research Agents

Add code
Jan 28, 2026
Viaarxiv icon

NEMO-4-PAYPAL: Leveraging NVIDIA's Nemo Framework for empowering PayPal's Commerce Agent

Add code
Dec 25, 2025
Viaarxiv icon

The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality

Add code
Dec 11, 2025
Viaarxiv icon

Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback

Add code
Jun 13, 2025
Viaarxiv icon

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

DeepInverse: A Python package for solving imaging inverse problems with deep learning

Add code
May 26, 2025
Viaarxiv icon

Learning Extrapolative Sequence Transformations from Markov Chains

Add code
May 26, 2025
Viaarxiv icon

El Agente: An Autonomous Agent for Quantum Chemistry

Add code
May 05, 2025
Viaarxiv icon

Llama-Nemotron: Efficient Reasoning Models

Add code
May 02, 2025
Figure 1 for Llama-Nemotron: Efficient Reasoning Models
Figure 2 for Llama-Nemotron: Efficient Reasoning Models
Figure 3 for Llama-Nemotron: Efficient Reasoning Models
Figure 4 for Llama-Nemotron: Efficient Reasoning Models
Viaarxiv icon

Benchmarking Self-Supervised Methods for Accelerated MRI Reconstruction

Add code
Feb 23, 2025
Viaarxiv icon